首页> 外文OA文献 >An Optimized Sparse Approximate Matrix Multiply for Matrices with Decay

【2h】

An Optimized Sparse Approximate Matrix Multiply for Matrices with Decay

机译：具有衰减的矩阵的最优稀疏近似矩阵乘法

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We present an optimized single-precision implementation of the SparseApproximate Matrix Multiply (\SpAMM{}) [M. Challacombe and N. Bock, arXiv {\bf1011.3534} (2010)], a fast algorithm for matrix-matrix multiplication formatrices with decay that achieves an $\mathcal{O} (n \log n)$ computationalcomplexity with respect to matrix dimension $n$. We find that the max norm ofthe error achieved with a \SpAMM{} tolerance below $2 \times 10^{-8}$ is lowerthan that of the single-precision {\tt SGEMM} for dense quantum chemicalmatrices, while outperforming {\tt SGEMM} with a cross-over already for smallmatrices ($n \sim 1000$). Relative to naive implementations of \SpAMM{} usingIntel's Math Kernel Library ({\tt MKL}) or AMD's Core Math Library ({\ttACML}), our optimized version is found to be significantly faster. Detailedperformance comparisons are made for quantum chemical matrices with differentlystructured sub-blocks. Finally, we discuss the potential of improved hardwareprefetch to yield 2--3x speedups.

机译：我们提出了稀疏近似矩阵乘法（\ SpAMM {}）[M.]的优化单精度实现。 Challacombe和N. Bock，arXiv {\ bf1011.3534}（2010）]，一种具有衰减的矩阵矩阵乘法格式的快速算法，相对于矩阵，它可以实现$ \ mathcal {O}（n \ log n）$的计算复杂度维度$ n $。我们发现，对于\ SpAMM {}容差低于$ 2 \ times 10 ^ {-8} $所实现的错误的最大范数低于单精度{\ tt SGEMM}的致密量子化学矩阵的误差的最大范数，而性能优于{\ tt SGEMM}，并且已经可以用于小矩阵（$ n \ sim 1000 $）。相对于使用英特尔数学核心库（{\ tt MKL}）或AMD核心数学库（{\ ttACML}）的\ SpAMM {}的幼稚实现，我们的优化版本明显更快。对具有不同结构子块的量子化学矩阵进行了详细的性能比较。最后，我们讨论了改进硬件预取以产生2--3倍加速的潜力。

著录项

作者
Bock, Nicolas; Challacombe, Matt;
展开▼
作者单位

展开▼
年度 2012
总页数
原文格式 PDF
正文语种 {"code":"en","name":"English","id":9}
中图分类

相似文献

外文文献
中文文献
专利

1. An optimized sparse approximate matrix multiply for matrices with decay [J] . Bock N., Challacombe M. SIAM Journal on Scientific Computing . 2013,第1期

机译：衰减矩阵的优化稀疏近似矩阵乘法
2. Semiempirical Molecular Dynamics (SEMD) I: Midpoint-Based Parallel Sparse Matrix-Matrix Multiplication Algorithm for Matrices with Decay [J] . Weber Valery, Laino Teodoro, Pozdneev Alexander, Journal of chemical theory and computation: JCTC . 2015,第7期

机译：半经验分子动力学（SEMD）I：具有衰减的矩阵的基于中点的并行稀疏矩阵-矩阵乘法算法
3. Encapsulating multiple communication-cost metrics in partitioning sparse rectangular matrices for parallel matrix-vector multiplies [J] . Ucar B, Aykanat C SIAM Journal on Scientific Computing . 2004,第6期

机译：在分割稀疏矩形矩阵以实现并行矩阵向量乘法时封装多个通信成本度量
4. A Nearly-Sublinear Method for Approximating a Column of the Matrix Exponential for Matrices from Large, Sparse Networks [C] . Kyle Kloster, David F. Gleich International workshop on algorithms and models for the web graph . 2013

机译：大型稀疏网络中矩阵的矩阵指数的近似亚线性方法
5. Matrix factorizations, triadic matrices, and modified Cholesky factorizations for optimization [D] . Fang, Haw-ren 2006

机译：矩阵分解，三元矩阵和改进的Cholesky分解以进行优化
6. GRiNCH: simultaneous smoothing and detection of topological units of genome organization from sparse chromatin contact count matrices with matrix factorization [O] . Da-Inn Lee, Sushmita Roy 2021

机译：GRINCH：从稀疏染色质触点计数矩阵同时平滑和检测基因组组织的基因组织组织的拓扑单元
7. An Optimized Sparse Approximate Matrix Multiply for Matrices with Decay [O] . Nicolas Bock, Matt Challacombe 2013

机译：优化的稀疏近似矩阵，矩阵与腐烂的矩阵
8. Parallel sparse matrix computations: Wavefront minimization of sparse matrices. Final report for the period ending June 14, 1998 [R] . 1999

机译：并行稀疏矩阵计算：稀疏矩阵的波前最小化。截至1998年6月14日的最终报告

An Optimized Sparse Approximate Matrix Multiply for Matrices with Decay

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅